Dynamic Feature Extraction from Molecular Mechanics Trajectories

نویسندگان

  • Han Altae-Tran
  • Muneeb Sultan
  • Vijay Pande
چکیده

With the recent advancements in distributed computing, it is now possible to simulate molecular dynamics of large systems over unprecedented time scales. However, due to the sheer size of the generated trajectories, it is no longer possible to reliably extract relevant chemical and biological information from a simulation through visual inspection alone a single trajectory alone may consist of well over a thousand of atoms, charted over hundreds of time points. Currently, Markov state models are being used to compute a discretization of the conformational space of large proteins. However, one current challenge lies in understanding the transitions between the states. To address this problem, we consider restoring the dynamic systems perspective and look for methods capable of detecting transition mechanisms in this context. Since physical interactions between atoms are typically proportional to distances between corresponding atoms, we are particularly interested in analyzing pairwise atomic distances. Unfortunately, the space of pairwise distances scales with the number of atoms squared. It is therefore desirable to develop methods that reduce the dimensionality of the space into readily interpretable coordinates that largely account for variations in the data. While offering a promise of fewer dimensions, conventional methods, such as Principal Components Analysis (PCA), often produce difficult to interpret coordinates, as each of the coordinates mixes a significant number of atoms that makes it difficult to definitely attribute motion along a principal component to motion in real space. Furthermore, the assumption of linearity is weak in the context molecular dynamics. Similar problems arise for factor analysis. Finally, many of these techniques are not readily applicable to time series data because measurements taken close in time cannot be considered independent samples from the distribution of atomic configurations. Here we seek to address this issue by proposing an unsupervised feature selection method for a system of time series that incorporates basic time dependent relationships within the system. For the application, we will be working with pairwise distances from single molecular dynamics trajectories; however, this method should be applicable to other feature spaces and possibly other fields.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

بازشناسی دانش فرد خطاکار به کمک سیگنال‌های تک ثبت مغزی، مبتنی بر روش غیرخطی منحنی‌های بازگشتی

In this study, Recurrence Plots (RPs) has been adapted as a nonlinear approach in order to detection of guilty subjects’ knowledge based on their single-trial ERPs. The dataset were acquired from 49 human subjects who were participated in a Concealed Information Test (CIT). According to the test protocol, guilty subjects denied their information about familiar faces, so the aim was to detect th...

متن کامل

Modeling Dynamics with Deep Transition-Learning Networks

Markov processes, both classical and higher order, are often used to model dynamic processes, such as stock prices, molecular dynamics, and Monte Carlo methods. Previous works have shown that an autoencoder can be formulated as a specific type of Markov chain. Here, we propose a generative neural network known as a transition encoder, or transcoder, which learns such continuous-state dynamic pr...

متن کامل

Feature Engineering for Map Matching of Low-Sampling-Rate GPS Trajectories in Road Network

Map matching of GPS trajectories from a sequence of noisy observations serves the purpose of recovering the original routes in a road network. In this work in progress, we attempt to share our experience of feature construction in a spatial database by reporting our ongoing experiment of feature extraction in Conditional Random Fields (CRFs) for map matching. Our preliminary results are obtaine...

متن کامل

Theoretical study of the temperature dependence of dynamic effects in thymidylate synthase.

A theoretical study of the temperature dependence of dynamic effects in the rate limiting step of the reaction catalyzed by thymidylate synthase is presented in this paper. From hybrid Quantum Mechanics/Molecular Mechanics (QM/MM) optimizations of transition state structures within a fully flexible molecular model, free downhill molecular dynamics trajectories have been performed at four differ...

متن کامل

Assessment of the Performance of Clustering Algorithms in the Extraction of Similar Trajectories

In recent years, the tremendous and increasing growth of spatial trajectory data and the necessity of processing and extraction of useful information and meaningful patterns have led to the fact that many researchers have been attracted to the field of spatio-temporal trajectory clustering. The process and analysis of these trajectories have resulted in the extraction of useful information whic...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013